The tail empirical process for long memory 
stochastic volatility sequences 

Rafal Kulik* Philippe Soulier^ 
September 16, 2010 



Abstract 

This paper describes the limiting behaviour of tail empirical pro- 
cesses associated with long memory stochastic volatility models. We 
show that such a process has dichotomous behaviour, according to an 
interplay between the Hurst parameter and the tail index. On the 
other hand, the tail empirical process with random levels never suffers 
from long memory. This is very desirable from a practical point of 
view, since such a process may be used to construct the Hill estimator 
of the tail index. To prove our results we need to establish new re- 
sults for regularly varying distributions, which may be of independent 
interest. 



1 Introduction 



The goal of this article is to study weak convergence results for the tail 
empirical process associated with some long memory sequences. Besides of 
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theoretical interests on its own, the results are applicable in different statis- 
tical procedures based on several extremes. A similar problem was studied 
in case of independent, identically distributed random variables in [12], or 
for weakly dependent sequences in [11], [10], [9], [19]. 

Our set-up is as follows. Assume that {Aj , i £ Z}, is a stationary 
Gaussian process with unit variance and covariance 

Pi -j = cov(Xj, Xj) = \i-j\ 2H - 2 U\i-j\) , (1) 

where H £ (1/2, 1) is the Hurst exponent and £q is a slowly varying function 
at infinity, i.e. lim^^ £ (tx) / £ (x) = 1 f° r an x > 0. The sequence in this 
case is referred to as an LRD Gaussian sequence. We also consider weakly 
dependent Gaussian sequences, i.e. such that Y^jLi l cov (Ai, Aj + i)| < oo. 

We shall consider a stochastic volatility process defined as 

Y t = a(Xi)Z h % £ Z, 

where a(-) is a nonnegative, deterministic function and that {Z,Z iy i £ Z}, 
is a sequence of i.i.d. random variables, independent of the process {A;}. 
We note, in particular, that if E[Z 2 ] < oo and K[Z] = 0, then the Y^s are 
uncorrelated, no matter the assumptions on dependence structure of the 
underlying Gaussian sequence. 

Stochastic volatility models have become popular in financial time series 
modeling. In particular, if if £ (1/2,1), these models are believed to cap- 
ture two standardized features of financial data: long memory of squares or 
absolute values, and conditional heteroscedascity. If a(x) = exp(rr), then 
the model is referred to in the econometrics literature as Long Memory in 
Stochastic Volatility (LMSV) and was introduced in [ i]. For an overview of 
stochastic volatility models with long memory we refer to [7]. 

Let F — Fi, % > 1, be the marginal distribution of Yj. We want to 
consider the case where F belongs to the domain of attraction of an extreme 
value distribution with positive index 7, i.e. there exist sequences u n , n > 1, 
u n — > 00, and o~ n , n > 1, such that the associated conditional tail distribution 
function 

T n {x) = F{ * u " + a f\ x > 0, n > 1, (2) 

satisfies 

lim T n (x) = T(x) = (1 + x)~ lh , x > . (3) 
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For the stochastic volatility model, this will be obtained through a further 
specification. Let Fz be the marginal distribution of the noise sequence. We 
will assume that for some a G (0, oo), 

F z (z) = P(Z >x) = x- a £(x) , (4) 

where i is again a slowly varying function. Assuming (4) and E[cr Q+<: (Xi)] < 
oo for some e > 0, we conclude by Breiman's Lemma [5] (see also [18, Propo- 
sition 7.5]) that 

F{x) = ¥(Y 1 >x)= F(a(X 1 )Z 1 > x) ~ E[a a (X 1 )]F(Z 1 > x) , as x -»■ oo. 

Consequently, F(-) satisfies (3) with a n = u n and 7=1/0;. 

Similarly to [19], we define the tail empirical distribution function and 
the tail empirical process, respectively, as 

1 n 

T n (s) = ■=- r / l{Yj>u n +u„s} j 

nF{u n ) ^ 

and 

e n (s) = T n (s) - r n (s) , s G [0, 00) . (5) 

From [19] we conclude that under appropriate mixing and other conditions on 
a stationary sequence Y iy i > 1, the tail empirical process converges weakly 
and the limiting covariance is affected by dependence. In our case, the results 
[19] do not seem applicable. In fact, it will be shown that we have two differ- 
ent modes of convergence. If u n is large, then ^ nF(u n ) is the proper scaling 
factor and the limiting process is Gaussian with the same covariance struc- 
ture as in case of i.i.d. random variables Yj. Otherwise, if u n is small, then 
the limit is affected by long memory of the Gaussian sequence. The scaling 
is different and the limit may be non-normal. These results are presented in 
Section 2.1. Note that a similar dichotomous phenomenon was observed in 
the context of sums of extreme values associated with long memory moving 
averages, see [16] for more details. On the other hand, this dichotomous 
behaviour is in contrast with the convergence of point processes based on 
stochastic volatility models with regularly varying innovations, where (long 
range) dependence does not affect the limit (See [6]). 

The process e n (-) is unobservable in practice, since the parameter u n de- 
pends on the unknown distribution F . Also, u n being large or small depends 
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on a delicate balance between the tail index a and the Hurst parameter H. 
In order to overcome this, we consider as in [19] a process with random levels. 
There, we set k = nF{u n ) and replace the deterministic level u n by Y n _k:n, 
where Y n:n > Y n -i :n > ■ ■ ■ > Y\, n are the increasing order statistics of the 
sample Y%, . . . , Y n . The number k can be thought as the number of extremes 
used in a construction of the tail empirical process. It turns out that if the 
number of extremes is small (which corresponds to a large u n above), then 
the limiting process changes as compared to the one associated with e n (-), 
but the speed of convergence remains the same. This has been already no- 
ticed in [19] in the weakly dependent case. On the other hand, if k is large, 
then the scaling from e n (-) is no longer correct (see Corollary 2.5). In fact, 
the process with random levels has a faster rate of convergence and we claim 
in Theorem 2.6 that the rate of convergence and the limiting process are not 
affected at all by long memory, provided that a technical second order regular 
variation condition is fulfilled. The reader is referred to Section 2.2. On the 
other hand, it should be pointed out that our results are for the long memory 
stochastic volatility models. It is not clear for us whether such phenomena 
will be valid for example for subordinated long memory Gaussian sequences 
with infinite variance. 

The results for the tail empirical process e n (-) allow us to obtain asymp- 
totic normality and non-normality of intermediate quantiles, as described in 
Corollary 2.4. On the other hand, the tail empirical process with random 
levels allows the study of the Hill estimator of the tail index a (Section 2.3). 
Consequently, as shown in Corollary 2.7, long memory does not have influ- 
ence on its asymptotic behaviour. These theoretical observations are justified 
by simulations in Section 3. 

Last but not least, we have some contribution to the theory of regular 
variation. To establish our results in the random level case, we need to work 
under a second order regular variation condition. Consequently, one has to 
establish in a Breiman's-type lemma that such a condition is transferable 
from Fz to F. This is done in Section 2.4. 
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2 Results 



2.1 Tail empirical process 



Let us define a function G n on (—00, 00) x [0, 00) by 



G n (x, s) 



P(<r(x)Zi > (1 + s)u n ) 



(6) 



By Breiman's Lemma and the regular variation of Fz, we conclude that 
for each s G [0,1], this function converges pointwise to T(s)G(x), where 
G(x) = cr a (x). A stronger convergence can actually be proved (see Section 
4.6 for a proof). 

Lemma 2.1. If (4) holds and E,[a a+e (X)] < 00 for some e > 0, then 



for all p such that pa < a + e. 

In order to introduce our assumptions, we need to define the Hermite 
rank of a function. Recall that the Hermite polynomials H m , m > 0, form an 
orthonormal basis of the set of functions h such that E[/i 2 (X)] < 00, where 
X denotes a generic standard Gaussian random variable (independent of all 
other random variables considered here), and have the following properties: 



where Sj^ is Kronecker's delta, equal to 1 if j = k and zero otherwise. Then 
h can be expanded as 



with c m = K[h(X)H m (X)] and the series is convergent in the mean square. 
The smallest index m > 1 such that c m 7^ is called the Hermite rank of h. 
Note that with this definition, the Hermite rank is always at least equal to 
one and the Hermite rank of a function h is the same as that of h — K[h(X)}. 



lim E sup \G n (X, s) - a a (X)T(s)\ p = 



(7) 



E[H m {X)] = , m > 1 , caviHjiX^HkiX)) = 5 j>k k\ 



00 




5 



Let J n (m, s) denote the Hermite coefficients of the function x —> G n (x, s). 
Since E[|if m (X L )| r ] < oo for all r > 1, Lemma 2.1 implies that the Hermite 
coefficients J n (jn, s) converge to J(m)T(s), where J(m) is the m-th Hermite 
coefficient of G, uniformly with respect to s > 0. This implies that for 
large n, the Hermite rank of (?„(•, s) is not bigger than the Hermite rank of 
G. In order to simplify the proof of our results, we will use the following 
assumption, which is not very restrictive. 

Assumption (H) Denote by J n (m,s), m > 1, the Hermite coefficients of 
G n (-, s) and let q n (s) be the Hermite rank of G n (-, s). Define 

q n = inf q n (s) , 

the Hermite rank of the class of functions {G n (-, s), s > 0}. In other words, 
the number q n is the smallest m such that J n (m,s) ^ for at least one s. 
Furthermore, let q be the Hermite rank of G. We assume that q n = q for n 
large enough. 

Remark. Since for large enough n it holds that q n (s) < q for all s, the 
assumption is fulfilled, for example, when G has Hermite rank 1 (as is the 
case for the function x — > e x ), or if the function a is even with the Hermite 
rank 2. 

The result for the general tail empirical process is as follows. 

Theorem 2.2. Assume (H) with q(l - H) ^ 1/2, (1), (4), nF(u n ) oo 
and that there exists e > such that 

< E[<r 2Q+e (X!)] < oo . (8) 

(i) If nF(u n )p1 y as n y oo or if {X,} is weakly dependent, then 
^nF{u n ) e n converges weakly in D([0, oo)) to the Gaussian process Wo 
T , where W is the standard Brownian motion. 

(ii) IfnF(u n )p1 — > oo as n — > oo then pn 9 ^ 2 e n converges weakly in D([0, oo)) 
to the process (E[cr Q (X 1 )])~ 1 J(q)TL g , where the random variable L q is 
defined in (29). 
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Remarks 

- We rule out the borderline case q(l — H) = 1/2 for the sake of brevity and 
simplicity of exposition. It can be easily shown that if q(l — H) = 1/2, then 
\JnF{u n )e n converges to W o T provided 1/F(u n ) tends to infinity faster 
than a certain slowly varying function (e.g. if u n = n 7 for some 7 > 0), 
even though it may hold in this case that np q n — > 00. The reason is that the 
variance of the partial sums of G(Xk) is of order n times a slowly varying 
function which dominates £q{u). 

- Here D([0,oo) is endowed with Skorohod's J\ topology, and tightness is 
checked by applying [2, Theorem 15.6]. Since the limiting processes have 
almost surely continuous paths, this convergence implies uniform conver- 
gence on compact sets of [0, 00). See also [21]. 

- The meaning of the above result is that for u n large, long memory does 
not play any role. However, if u n is small, long memory comes into play 
and the limit is degenerate. Furthermore, in the case of Theorem 2.2, 
small and large depend on the relative behaviour of the tail of Y\ and 
the memory parameter. Note that the condition nF(u n )p1 — > 00 implies 
that 1 — 2g(l — H) > 0, in which case the partial sums of the subordinate 
process {G(Xi)} weakly converge to the Hermite process of order q (see 
Section 4.1). The cases (i) and (ii) will be referred to as the limits in the 
i.i.d. zone and in the LRD zone, respectively. 

- Condition E[cr Q+e (Xi)] < 00 is standard when one deals with regularly 
varying tails. However, we need the condition E[<t 2q+<e (Xi)] < 00 in or- 
der to obtain the limiting distributions in the i.i.d. and LRD zones. See 
section 4.3.2. 

- The result should be extendable to general, not necessary Gaussian, long 
memory linear sequences. Instead of the limit theorems and covariance 
bounds of Section 4.1, one can use limit theorems from [15], and the co- 
variance bounds of [14, Lemma 3]. 

- Rootzen [19] obtained asymptotic the behaviour of the tail empirical pro- 
cess of a general stationary sequence {3^} under, in particular, the following 
conditions (see [19, Section 4]): 

• l n = o(r n ), r n = o(n); 
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(CI) E[\N n (x,y)\ p \N n (x,y) ^ 0] < oo, where p > 2 and N n is the point 
process of exceedances; 

(C2) (3 n (l n )n/r n — > 0, where (3 n (-) is the /3-mixing coefficient w.r.t. sigma 
field generated by the random variables 3^1{;y >«„}; 

(C3) 

cov 1 {3%>«n(i+«)}> ^^(l+t)} -> 

for some function r(x, y). 

Assume that r n — > oo, r n = o{n). For the sequence {Yj} under considera- 
tion here, it can be computed (see Section 4.3.1) 



1 

-cov 



r n F(u n ) 



.1=1 j=l 

T(s)T(t)J 2 (g)r n F(n n )p^ 



T(sVt) + 



E2[t7«(X 1 )]g!(l-2g(l- J ff)) ' 



Now, using (1), r n F(u n )pl n ~ F(u n )rn 2q ^ H \ Since r n = o(n), then the 
second part converges under the condition nF(u n )p^ — > 0. Consequently, 
Case (i) guarantees that the condition (C3) is fulfilled. As for the mixing 
property (C2), it is usually established by proving the standard /3-mixing, 
i.e. the one defined in terms of random variables Yj, not Yjlry >u n \- Now, if 
{Xj} is (3- mixing (in the latter sense) with rate (3 n , then the same holds for 
{Yj}. In our case, the sequence {Xj} has long memory, and thus it cannot 
be /3-mixing. Therefore, it is very doubtful that (C2) can be verified. 

Note also that in the case YlJLi l cov (Ao, Xj)\ < oo, which we refer to as 
the short memory case, the conclusion of part (i) of Theorem holds without 
any additional (mixing) assumption on the Gaussian process {Xj}. 

Moreover, results in the LRD zone cannot be obtain by applying Rootzen's 
or any other results for weakly dependent sequences. 



2.2 Random levels 

Similarly to [19], we consider the case of random levels. Let =>- denote weak 
convergence in D([0,oo)). Define the increasing function U on [l,oo) by 
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U(t) = F^(l — 1/t), where is the left-continuous inverse of F. Let k 
denote a sequence of integers depending on n, where the dependence in n is 
omitted from the notation as customary, and such that 

lim k = lim n/k = oo . (9) 

Such a sequence is usually called an intermediate sequence. Define u n = 
U(n/k). If F is continuous, then nF(u n ) = k, otherwise, since F is regularly 
varying, it holds that lim^oo k~ l nF(u n ) = 1. Thus, we will assume without 
loss of generality that k = nF{u n ) holds. Then the statements of Theorem 
2.2 may be written respectively as 

Vk(f n -T n ) WoT, (10) 

-i/ 2 (f - T ) =*> T-L (11) 

Let us rewrite the statements of (10), (11) as 

w n (T n -T n )=>w , 

where 

w n = Vk if lim kp q n = , (12) 

n— ¥co 

w n = p~ q/2 if lim kp q n = oo , (13) 

n— >oo 

and w = W o T if (12) holds (i.i.d. zone) and w = (E[a a (X 1 )])~ 1 J(q)TL q 
if (13) holds (LRD zone). 

We now want to center the tail empirical process at T instead of T n . To 
this aim, we introduce an unprimitive second order condition. 



lim w n \\T n - T||oo = , (14) 

n— >oo 

where 

(ct(X)Z > u n t) 



\T n - Tlloo = sup 



r 



The following result is a straightforward corollary of Theorem 2.2. 
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Corollary 2.3. Under the assumptions of Theorem 2.2, if moreover (14) 
holds, then w n (T n — T) converges weakly in D([0, 00)) to the process w. 

Let Y n: i < • ■ ■ < Y n:n be the increasing order statistics of Y±, . . . , Y n . The 
former result and Verwaat's Lemma [18, Proposition 3.3] yield the conver- 
gence of the intermediate quantiles. 

Corollary 2.4. Under the assumptions of Corollary 2.3, w n (Y n . n ^ — u n )/u n 
converges weakly to 710(1). 

Define 

1 n 

In this section we consider the practical process 

e* n (s)=f n (s)-T(s) J se[0,oo). 

For the process e*(-), the previous results yield the following corollary. 

Corollary 2.5. Assume (H), (1), (4), (8) and (14). Then w n e* n converges 
weakly in D([0, 00)) to w — T ■ w(0), i.e. 

• Iflimn^ookp^ = or {Xj} is weakly dependent, then 

Vke* n ^BoT (15) 
where B is the Brownian bridge. 

• If\\va. n ^ f00 kp'^ — > 00, then 

Pn q/2 e n =► . 

The convergence of w n (T n — T) to w — T-w(0) is standard. The surprising 
result is that in the LRD zone the limiting process is 0, because the limiting 
process of w n (T n — T n ) has a degenerate form, i.e. the limit is the random L q , 
multiplied by the deterministic function T(-). In fact, as we will see below, 
there is no dichotomy for the process with random levels, and the rate of 
convergence of e* is the same as in the i.i.d. case. 

To proceed, we need to introduce a more precise second order conditions 
on the distribution function Fz of Z. Several types of second order assump- 
tions have been proposed in the literature. We follow here [8]. 
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Assumption (SO) There exists a bounded non increasing function 77* on 
[0,oo), regularly varying at infinity with index —a(3 for some > 0, and 
such that lim^oo t]*(t) = and there exists a measurable function rj such 
that for z > 0, 

F(Z > z) = cz~ a exp [ ^ ds , (16) 

Ji s 

3C>0, Vs>0, \t]{s)\ < Cr]*{s) . (17) 

If (16) and (17) hold, we will say that Fz is second order regularly varying 
with index —a and rate function 77*, in shorthand Fz G 2RV{— a, 77*). 

Theorem 2.6. Assume (H), (1), (4), (SO) wife rate function rj* regularly 
varying at infinity with index —a(5 and there exists e > such that 

0<E[ ( t 2q(/3+1)+£ (X 1 )] <oo. (18) 

// 

lim \/krf{U{n/k)) = , (19) 

then y/k&n converges weakly in D([0, 00)) to B oT, where B is the Brownian 
bridge (regardless of the behaviour ofkp q n ). 

Remark. The additional moment condition (18) ensures that the distribution 
of Y satisfies a second order condition. See Section 2.4 for more details. It 
is also used in a proof of tightness argument (see (55) below). 

The behaviour described in Theorem 2.6 is quite unexpected, since the 
process with estimated levels Y n _k :n has a faster rate of convergence than the 
one with the deterministic levels u n . A similar phenomenon was observed 
in the context of LRD based empirical processes with estimated parameters. 
We refer to [17] for more details. 



2.3 Tail index estimation 

A natural application of the asymptotic result for the tail empirical process 
e* is the asymptotic normality of the Hill estimator of the extreme value 
index 7 defined by 
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Since 7 = / °°(1 + s) 1 T(s) ds, we have 

Jo l + « 

Thus we can apply Theorem 2.6 to obtain the asymptotic distribution of the 
Hill estimator. 

Corollary 2.7. Under the assumptions of Theorem 2.6, vk(j n — 7) converges 
weakly to the centered Gaussian distribution with variance 7 2 . 

It is known that the above result gives the best possible rate of con- 
vergence for the Hill estimator (see [8]). The surprising result is that it is 
possible to achieve the i.i.d. rates regardless of H. 

2.4 Second order conditions 

Whereas the transfer of the tail index of Z to Y is well known, the transfer 
of the second order property seems to have been less investigated. We state 
this in the next proposition, as well as the rate of convergence of T n to T and 
G n to G x T. 

Proposition 2.8. If Fz € 2RV(— a, 77*), where 77* is regularly varying at 
infinity with index —a(3, for some (3 > 0, and if 

E[a a(/m)+£ pO] < 00 , (20) 

for some e > 0, then F G 2RV(—a, rf), and 

||T n -T|| 00 = 0(r7*( Mn )) . (21) 

Moreover, for any p > 1 such that pa((3 + 1) < a(/3 + 1) + e, 

E[mp s > \G n (X,s) - a a (X)T(s)\i>] =0( V '(u n ) p ) . (22) 

Examples The most commonly used second order assumption is that rj* (s) = 
0(s~ a °) for some /3 > 0. Then 

F z {x) = cx~ a (l + O(x~ a0 )) as x -> 00 , (23) 
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for some constant c > 0. Then, \\T n — = 0((k/n)P), and the second 
order condition (14) becomes 

fk\ w 

lim k - = , if lim kp q n = (24) 



n J 



and 

'k\ 2 ? 



n— ¥oo 



lim p- q - = if lim kp q n = oo . (25) 

n— >oo \ jt, y n—s-oo 

Condition (24) holds if both fc < n (2/J)/(2/?+i) and fc < n 2(i-H) The central 
limit theorem with rate -y/fc holds if k x n 7 with 

7 < 2(1- if) V 



2(3 + 1 

Condition (25) holds if n 2 ^~ H > <C k <C n 1 "^ 1 " 11 ^ 13 . This may happen only if 

' 2H-1 

or equivalently 

i> ff > 1 + <' 5 



2/3 + 1 

As /? — y 0, only for very long memory processes (i.e. if close to 1) will the 
LRD zone be possible. 

The extreme case is the case /3 = 0, i.e. t]* slowly varying. For instance, 
if r?* (x) = l/log(x) (for x large), then the tail F(x) = x _a log(a;) belongs to 
2RV(-a,r]*) and U(t) ~ {tlog(t)/a}~ 1/a . The second order condition (14) 
holds if 

fc 1/2 log _1 (n) -> . 

If this condition holds, then kp\ — > for any if > 1/2 and the LRD zone 
never arises, because the LRD term in the decomposition (33) is always 
dominated by the bias. 
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3 Numerical results 



We conducted some simulation experiments to illustrate our results. We used 
R functions HillMSEO and HillPlot available on the authors webpages. 

Our first experiment deals with the Mean Squared Error. 

1. Using R-fracdiff package we simulated fractional Gaussian noises 
sequences {X^d)} with parameters d = 0,0.2,0.4,0.45. Here, d = 
H — 1/2, so that d = corresponds to the case of an i.i.d. sequence. 

2. We simulated n = 1000 i.i.d. Pareto random variables Zi with param- 
eters a = 1 and 2. 

3. We set Y t (d) = exppf^d))^. 

4. Hill estimator was constructed for different number of extremes. 

5. This procedure was repeated 10000 times. 

6. The results are displayed on Figure 1, for a = 1 and a = 2, respectively. 
On each plot, we visualise Mean Square Error (with the true centering) 
w.r.t. the number of extremes. Solid lines represent different LRD 
parameters: black for d — 0, blue for d = 0.2, red for d = 0.4 and green 
for 0.45. 

We note that for a = 1, when a small number of extreme order statistics k 
is used to build the Hill estimator, there is not much influence of the LRD 
parameter, and in particular the MSE is minimal for more or less the same 
values of k through all the range of values of d. This is in accordance with our 
theoretical results. For a = 2, the influence of the memory parameter is more 
significant. These two features can be interpreted. First, it seems natural 
that the long memory effect appears when a greater number of extreme order 
statistics is used, since our result is of an asymptotic nature. For a small 
number of extremes the i.i.d. type of behaviour dominates (see R n (-) in (33)), 
so the asymptotic result is seen; for a larger number of extremes, the long 
memory term S n in (33) starts to dominate. For an extremely large number 
of order statistics (i.e. k x n), the bias dominates. The influence of a on the 
quality of the estimation is twofold. On one hand, the asymptotic variance 
of the Hill estimator is a 2 , so that the MSE increases with a. Also, for very 
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Figure 1: MSE: a = 1 (left panel), a = 2 (right panel); color codes: black - d = 0, 
blue - d = 0.2, red - d = 0.4, green - 0.45 



small values of a, the peaks observed are extremely high and completely 
overshadow the effect of long memory. 

Next, we show Hill plots for several models, since in practice one usually 
deals with just a single realization. 

1. We consider the model Yi = exp(rXj)Z;, where {X{\ is as above a 
fractional Gaussian noise and r = 0.05 or 2. 

2. We simulated n = 1000 i.i.d. Pareto random variables Zi with param- 
eter a = 2. 

3. We simulated fractional Gaussian noise sequences {Xi} with parame- 
ters d = (i.i.d. case), 0.2, 0.4, 0.45. 

4. The estimators are plotted on Figures 2 and 3. The left panel corre- 
sponds to the Hill estimator for iid Pareto random variables {Z^}, and 
the right one for the long memory stochastic volatility process {Yj}. Re- 
call that the Yi are dependent asympotically Pareto random variables, 
so that there are two sources of bias for the Hill estimator. 

We may observe that for a small volatility parameter r there is not too 
much difference between the two plots. However, if r becomes bigger, the 



15 



200 400 GOO 800 1 ooo 
Number of extremes 



O 200 400 600 SOO 1 OOO 

Number of extremes 



Figure 2: Hill estimator: a = 2 and Pareto iid (left panel), r = 0.05 (right panel) 
color codes: black - d = 0, blue - d = 0.2, red - d = 0.4, green - 0.45 
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Figure 3: Hill estimator: a = 2 and Pareto iid (left panel), r = 1 (right panel) 
color codes: black - d = 0, blue - d = 0.2, red - d = 0.4, green - 0.45 
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estimation with a large number of extremes is completely inappropriate if 
d > 0, though without much influence of the strength of the dependence 
(i.e. increase of d) on this degradation. The reason is that the second order 
condition satisfied by the stochastic volatility model yields the same rate 
of convergence as in the i.i.d. case, but an increase in the variance of the 
Gaussian process {X t } entails a bigger bias in finite sample. 



4 Proofs 



4.1 Gaussian long memory sequences 

Recall that each function G(-) in L 2 (d/i), with p(dx) = (2tt)~ 1 ^ 2 exp(— x 2 /2) dx 
can be expanded as 

J{m) 



G{X) = E[G(X)\ + V J -^±H m {X) 



mi 

m=l 

where J(m) = K[G(X) H m (X)] and X is a standard Gaussian random vari- 
able. Recall also that the smallest q > 1 such that J(q) ^ is called the 
Hermite rank of G. We have 

E[G(X )G(X k )} = E[G(Xo)\ + £ p% , (26) 

m=q 

where p k = cov(X , X k ). Thus, the asymptotic behaviour of K[G(Xo)G(Xk)} 
is determined by the leading term p q n . In particular, if 1 — q(l — H) > 1/2, 
which implies that n 2 p^ — > oo, 



var 



and 



n 

^G(X,-)4j(g)L 3 , (28) 



q/2 _ 

npU j=1 
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where 

L q = (q\(l -2q(l - H))' 1 ' 2 Z H , q {l) (29) 

and Zu,q is the so-called Hermite or Rosenblatt process of order q, defined 
as a g-fold stochastic integral 



roc roc it( X1 +-+x q ) _ J 9 

z H , q (t) = ... — - — - — n x i ■ ■ ■ w ^> 

J — oo J— oo 3^1 + " " " + _. =1 



where W is an independently scattered Gaussian random measure with Lebesgue 
control measure. For more details, the reader is referred to [20]. On the other 
hand, if 1 — q(l — H) < 1/2 or {X,} is weakly dependent, then 



1 " 

-=Y,G{X j )AN%T$, (30) 



n 

3=1 



where Eg = var(G(X )) + 2 £~ i cov(G(X ), G{Xj)) < oo. 
We will also need the following variance inequalities of [1]: 

• If 1 — q(l — H) > 1/2, then for any function G with Hermite rank q , 

var ^n" 1 G ( X j)j < C Pl var(G(Xi)) . (31) 

• If 1 — q(l — H) < 1/2, then for any function G with Hermite rank q , 

var ^n~ l G ( X j) j < var(G r (X 1 )) . (32) 

In all these cases, the constant C depends only on the Gaussian process {Xj} 
and not on the function G. The bounds (31) and (32) are Equation 3.10 
and 2.40 in [1], respectively. 
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4.2 Decomposition of the tail empirical process 



The main ingredient of the proof of our results will be the following decom- 
position. Let X be the a- field generated by the Gaussian process {X n }. 



nF(u n/ 



1 n 



1 

+ ^ {P(1S ' > (1 + S)Mn|A,) " P(Un) } 



n> 3=1 



=: R n {s) + 5 n (s) . (33) 

Conditionally on X, R n is the sum of independent random variables, so it 
will be referred to as the i.i.d. part; the term S n is the partial sum process of 
a subordinated Gaussian process, so it will be referred to as the LRD part. 



4.3 Proof of Theorem 2.2 

We first give a heuristic behind the dichotomous behaviour in Theorem 2.2. 
Then, we prove convergence of the finite dimensional distributions of the i.i.d. 
and LRD parts. Finally, we prove tightness and asymptotic independence. 

4.3.1 Heuristic 

To present some heuristic, let us compute covariance of the tail empirical 
process. We have 

COv(f n (s),f n (t)) = n pl^ r COv(l { y 1>Mn( i +s)} , l{ Yl >u„(l+t)}) 

2 n_1 

+ n 2_p2( Un ) ~ j) C V ( 1 {Kl>«n(l+.)} ) 1{Y J+1 >M1+*)} ' 
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Recall (3). If E[cr Q+e (X 1 )] < oo holds, we apply Breiman's Lemma to both 
nominator and denominator to get 

j. COv(l{y 1>Mn (i +5 )}, l{y 1>M „(i+t)}) 
n->oo F{u n ) 

!■ E[q"(X 1 )]P(Z 1 > n n (l + g) V u n (l + t)) _ 
-i™ E[a-(X 1 )]P(Z 1 > u n ) -i{svt). 

Furthermore, if K[a a+e (Xi)a a+e (Xj + i)} < oo holds (which is guaranteed by 
(8)), then a generalization of Breiman's Lemma yields 

COv(l { y 1>M „(i + s)}, l{Y J+1 >u n (l+t)}) 
rwoo F 2 (ll n ) 

Urn P(^>^ + _s) Y > Un[1 + t)) 
F 2 {u n ) 



71— >OC 



T(a)T(t) 



E[ < 7«(X 1 )]E[ < 7«(X i+1 )] 
Therefore, for fixed s and t, using (27) in the case q(l — H) < 1/2, we obtain 
cov (f n (s),f n (t)) 

nF(u n ) 



T(sWt) T{s)T{t)J\q)pl 



nF{u n ) ql(l - 2q(l - H^E^X,)} 



In particular, setting s — t, then we conclude that the normalization factor 
for e n (-) should be \J nF(u n ) or p n q ^ depending whether nF(u n )p1 — > or 
nF(w n )p^ — > oo holds. The asymptotic variance also suggests the form of 
limiting distributions in Theorem 2.2. 



4.3.2 Finite dimensional limits 

Let A denote weak convergence of finite dimensional distributions. It will 
be shown in Section 4.3.2 and 4.3.2, respectively, that for each m > 1 and 
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Si G [0, oo), / = 1, . . . , M, si < ■ ■ ■ < s M , 



'nF{u n ) {Rn(si), R n ( Sl ) - R n ( Sl ^), 1 = 2,..., M) 

A(JV(o J r( Sl )),jV(o J r(5,)-r(s,_ 1 )),z = 2,... J M) , (34) 

where the normal random variables are independent, and 

p^ 2 (S n ( Sl ), S n (s M )) 4 E[ ^ ] Xi)] (nsi)i • • • , T{s M ))L q , (35) 

if 1 - q(l - H) > 1/2. On the other hand, if 1 - q(l - H) < 1/2, then the 
second term S n (-) is of smaller order than the first one, Rn(-). 

The i.i.d. limit 



Define 



Then 



L n ,j{x, S) = l{a(x)Z J >(l+s)u n } ~ F(a(x)Z 1 > (1 + s)u n ) 



R n(s) = ^2L n j(Xj,s) . 
3=1 

r(m). 



Set L n>j (x) = L nJ (x, 0) and V^ m> (x) = E[L™-(x)]. Note that E[K (1) (X,-)] = 
and 

V^\x) = ¥(a(x)Z 1 > u n ) - F 2 (a(x)Z 1 > u n ) . 
Let R n := R n (0). Therefore, for fixed t, 

logE \^V n ^) R ^\X 

it 



^logE 



cxp 



^logE 

-t 2 
2nF(u r 



y/nF(u n ) 
it 



{l {Y]>Un} -¥(Y 3 >u n \X)}\ | X 



1 " v^m LnAXj) ~ + Ln ' j(Xj)0 [wmk 2 ) 1 * 



3=1 

(36) 
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We will show that 



(37) 



given that E[cr a+<5 (Xi)] < oo. This also shows that the second term in (36) 
is negligible. Furthermore, since for sufficiently large n and 5 > (cf. (59)), 

\V^ 3) (x)\ < CF(a(x)Z l > u n ) < C{a{x) V l) a+s P(Z 1 > u n ) , 

the expected value of the last term in (36) is 

Consequently, the last term in (36) converges to in L 1 and in probability. 
Therefore, on account of (37) and the negligibility, we obtain, 



logE 



jty/ nF{u n )R n 



X 



A -t 2 /2 



(38) 



and from bounded convergence theorem we conclude (34) (for M = 1 and 
s = 0). It remains to prove (37). By Lemma 2.1, for each j > 1, G n (Xj,s) 
converges in probability and in L 1 to a a (Xj). Therefore, 



lim E 



1 ¥(a{X j )Z 1 > u n | X) 
n ^ 

3=1 



Next, since a a (Xj), j > 1, is ergodic, we have 

1 n 

71 < ■* 



a a {X 3 ) 



(39) 



(40) 



3=1 



Thus, (39), (40) and Breiman's Lemma yields 

1 n 



(41) 



Write now 



nF(u n 



n n 
^E^i) = 1 + 0P(1) + E P2 (^)^ > U n I *) • 



3=1 
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By Lemma 2.1, we have, for some 8 > small enough, 



i " 

CF{Z > «- Y^W X i) v l ? a+& ^ • ( 42 ) 



< 

n 

This proves (37) and (34) follows with M = 1 and si = 0. The case of a 
general M > 1 is obtained analogously. 

Long memory limit 

Recall the definition (6) of G n (-, s) and that G(x) = a a (x). Define 

J n (m, s) = E[H m (X 1 )G n {X 1 , s)}, J(m) = E[H m (X 1 )G(X 1 )}, 

the Hermite coefficients of G n (-,s) and G(-), respectively. Let q be the Her- 
mite rank of G(-). We write (recall Assumption (H)), 



J2(Gn(X j ,s)-E[G n (X j ,s)}) 



- 1 1 TJ ^w + 1 1 JJm ' 3) ;f w J(m) g- w) 

j'=l m=q j'=l m=g 

=: T( S )^ + 5 n ( S ), (43) 

with S 1 * = 5^" =1 G(Xj). On account of Rozanov's equality (26), we have that 
the variance of the second term is 



var(£ n ( S )) = ££ WBl ' Vr J l ^^ ; cov-(X t ,X,) 

i,j=l m=q 

{J n {m 1 s)-T{s)J{m)f 



m! 

ij=l m=g 

= ||G„(-,a) -r( a )G(.)||i» ( ^ £ Icav^jgi 
<C7n 2 ^||G? B (., a )-T( a )G?(.)||i» w . (44) 
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Since E[<t 2q!+5 (X)] < oo, by Lemma 2.1, G n (-,s) converges T(s)G(-) in 
L 2 (dp), uniformly with respect to s. We conclude that the second term 

on the right handside of (43) is op (npn 2 ^, i.e. it is asymptotically smaller 

than the first term. Furthermore, 

Sn(s) = P{ %> «) " n.Gn{X^ a)]) , (45) 

nr [U n ) . =1 



so that via (28) and (60) 



Pn q/ sm -+ ' (46) 



if 1 — q(l — H) > 1/2. Consequently, (35) holds for M — 1. The multivariate 
case follows immediately. On the other hand, if 1 — q(l — H) < 1/2, then via 
(30) and (60), 

sup S n (s) 4 J V(0,Eg) , 

se[o,i] Jii[(T Q (Ai)J 

which proves negligibility with respect to the term R n {-)- 



4.3.3 Asymptotic independence 



In this section we prove asymptotic independence of R n (-) and S n (-). We 
will carry out a proof for the joint characteristic function of (R n , S n ) = 
(i? n (0), S n (0)). Extension to multivariate case is straightforward. On account 
of (38), (46) and the bounded convergence theorem, we have 



E 



exp ^is^nF(u n )R n + itp n q/2 S n \ 



E 



E[exp{is^/nF(u n )R n } | X] exp {itp- q/2 S n ) 



exp (- s y2)ijj L , 



t as n — > oo , 



where ipL q (-) is the characteristic function of L q . This proves asymptotic 
independence. 
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4.3.4 Tightness 

In order to prove the tightness in X>([0, oo)) endowed with Skorokhod's J\ 
topology of the sequence of processes R' n := ^/nF(u n )R n , we apply the 
tightness criterion of [2, Theorem 15.4]. We must prove that for each A > 
and e > 0, 

lim lim sup F(w'{(R' n , 5) > e) = , (47) 

where for any function g G T>([0, oo)), 

w a(9 ,S) = sup \g(s)-g(t 1 )\A\g(t 2 )-g(s)\. 

0<ti<s<t 2 <A 

Since the Y^s are independent conditionally on X, by elementary computa- 
tions similar to those that lead to [2, Inequality 13.17], we obtain that 

E - K(ti)\ 2 \K(t 2 ) - R' n (s)\ 2 | X] < 3{Q n ( tl ) - Q n {t 2 )f , (48) 

where 

1 " 

Q n (s) = _^ Yl ^M 1 + *)M X i)) ■ 

Note that Q n (s) converges in probability to T(s) which is a continuous de- 
creasing function on [0, oo). Let m > 1 be an integer and set 5 = A/2m. 
Applying [2, Theorem 12.5] and using the same arguments as in the proof 
of [2, Theorem 15.6] (p. 129, Eq. (15.26); note that the assumed continuity 
of the function F that appears therein is not used to obtain (15.26)), we see 
that the bound (48) yields, for some constant C (whose numerical value may 
change upon each appearance), 

2m- 1 

P«(K, S) > e | X) < Ce~ A £ {Qn(k8) - Q n ((k + 2)5)} 2 

k=0 

< Ce- 4 Q„(0) max {Q n (kS) - Q n ((k + 2)5)} . 

0<fc<2m— 1 

Letting now n — > oo yields 
limsupP(w^«,5) >e\X) 



< Ce" 4 max {T(k5) - T((k + 2)5)} < Ce~*5 

0<k<2m— 1 



4 raAl 
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By bounded convergence, this yields 

limsupP(^«,5) > e) < Ce"V A1 , 

n— >oo 

and (47) follows. 

We prove now tightness of S n . Assume first 1 — q(l — H) > 1/2 and define 
S' n = p n q ^ 2 S n . Applying (31) there exists a constant C, which depends only 
on the Gaussian process {Xj}, such that we have, for s < t, 

vav{S' n {s) - S' n (t)) < Cvar(G n (X 1 , s) - G n (X u t)) 

<CE f P 2 K(i + s) < a(x 1 )z 1 < Un {i + t)\xy 

[ V(Z>u n ) 

Let the expectation in last term be denoted by Q' n (s,t). By the same adap- 
tation of the proof of [2, Theorem 15.6] as previously (see also [13] for a more 
general extension), we obtain, for each A > 0, and for 5 = A /2m for an 
integer m > 1, 

2m- 1 

F(w'i(S' n , 5)>e)< Ce~ 2 ]T Q' n (2k5, (2k + 2)5) . 

k=0 

Thus, letting n tend to infinity while keeping m fixed, we get 

2m- 1 

hmsupP«(^,5) >e) <Ce~ 2 ^ {(1 + 2/W)~ a - (1 + (2k + 2)5)~ a } 2 

2m- 1 



< Ce~ 2 5 2 (1 + 2M)- 2a ~ 2 < Ce~ 2 5 . 



k=0 



Thus lim^o hmsup^^ F(w'^(S' n , 5) > e) = and this concludes the proof 
of tightness. 



4.4 Proof of Corollary 2.5 and Theorem 2.6 

As in case of Theorem 2.2, we start some heuristic. Recall computation from 
Section 4.3.1 and the form of the limiting distribution w — T ■ w(0). Then 

vax(f n (s)) = (1 + o(l))var(T n (s) - T(s)f n (0)) 
= (1 + °( 1 ))^J T ( S )( 1 " T (*)) + oi^^Pl 
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This suggests that in LRD zone p n q T n (-) converges to 0. 

To prove it formally, denote T n = T n - T and £ n = Yn ~ k u n n ~ Un = T%~(1). 
Then T n (£ n ) = 1, and we have 

1 = en(£n) + T n (C n ) = e re (£ re ) + T re (£ n ) + T(Q . 

Thus, 

T(£ n ) - 1 = -e n (£ n ) - f n (Q . (49) 

For any s > 0, f n (s) = f n {s + £ n (l + a)) and T(s + £ n (l + s)) = T(s)T(£ n ), 
thus 

e;(s) = e n (s + £ n (l + s)) + f n (s + £ n (l + s)) + T{s + f n (l + s)) - T{s) 
= e n (s + e„(l + s)) + T(s){T(£ n ) - 1} + f„(s + £„(1 + s)) . 

Plugging (49) into this decomposition of e* , we get 

e*(s) = e n (s + £ n (l + s)) - T(s)e n (C n ) + T n (s + £ n (l + s)) - T(s)f n (£ n ) . 

(50) 

In order to prove Corollary 2.5, we write 

w nKi (s) = w n {e n (s + £ n (l + s)) - T(s)e n (£ n )} + 0(w„||T n - T^) . (51) 

Since the convergence in Theorem 2.2 is uniform, and by Corollary 2.4 £ n = 
op(l), the first term in (51) converges in D([0, oo)) to w — T ■ w(0). Under 
the second order condition (14), the second term is o(l). This concludes the 
proof of Theorem 2.5. 

We now prove Theorem 2.6. In order to study the second-order asymp- 
totics of w n e* n (s), we need precise expansion for e n (s+£ n (l+s)) and e n (£). For 
this we will use the expansions of the tail empirical process in Section 4.3.2. 
Since F{u n ) = k/n, using (33), (43) and (45), we have 

e n (s) = R n (s) + ^4 T(s)S: + ^-Ms) , (52) 
nF(u n ) nF{u n ) 
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which, noting again that T(s + £ n (l + s)) = T(s)T(£ n ), yields 
e n (s + e„(l + s)) - T{s)e n {i n ) = R n (s + £ n (l + s)) - T(s)i? n (^) 

+ ^4{^„( S + 4(1 + *)) - T( S )^ n (e„)} 
nF[u n ) 

and 

e;(s) = i?„(s + + s)) - T{s)R n {C n ) 

F 7 (u ) ~ ~ 
+ ^^{S n (sHn(l + s))-T(s)S n (Q}+T n (s+Ul + s))-T(s)T n (Q . 
nF(u n ) 

(53) 

Similarly to (44), and utilising Fz(u n ) / F{u n ) = 0(1), 



var 



S n (s) ) < C{pl V ^n-^WGni-, s) - T(s)G(-) ||£ 2M . 



nF(u n 

Using the second order Assumption (SO) through (22), we obtain 



var 



^T^S n (s) ) = O {{pi V ^(n^KW 2 ) = o (r/*K) 2 ) . (54) 
ni* (u„J 



Using (52) in the representation (50) and since Proposition 2.8 implies that 
\\T n - T||oo = 0{rf{u n )), we obtain: 

e* n {s) = R n (s + 4(1 + s)) - T(s)R n (£ n ) + P ( V *(u n )) . 

Since we have already proved that the convergence of \/kR n is uniform, we 
obtain that \fke* n converges in the sense of finite dimensional distribution to 
B o T, where B is the Brownian bridge, if the second order condition (19) 
holds. To prove tightness, we only have to prove that k 1 ^ 2 n~ 1 S n converges 
uniformly to zero on compact sets. For s > and i6R, denote G n (x, s) = 
G n (x, s) — T(s)G(x) and recall that we have shown in Section 4.3.4 that 

r2~ 2 var(S' n (s) - S n (s')) < C\\G n (-, s 2 ) - G n (-, si)|||a (4tt j . 

Applying (63), we get 

n- 2 vzr(S n (s) - S n (s')) < C( V *(u n )) 2 E [(a(x) V l)M/W)+e] ( s - s ') 2 , (55) 

which proves that k 1 ^ 2 n~ 1 S n converges uniformly to zero on compact sets. 
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4.5 Proof of Corollary 2.7 

Using the decomposition (53), and the identity / °°(1 + s) -1 T(s) ds = 7, we 
have 



7n - 7 



00 e*Js) 



1 + 3 



ds 



fl n (s + £ n (l + s)) 



1 + s 



ds - -iR n {i n ) 



+ 



+ 



nF(u n ) 



1 + s 



ds — 7 



nF(u r 



T n (s + £„(1 + s)) 



1 + s 



ds - 7T n (^ n ) . 



(56) 
(57) 



We must prove that the terms in (56) and (57) are Op(r)*(u n )) and that 

^f(l + .)"'*.(. + 6.(1 + «)) d. A f EJiZM ds = 7 /' E« d( . 

7o io 1 + s Jo t 

(58) 

To prove (58), we follow the lines of [18, Section 9.1.2]. We must prove that 
we can apply continuous mapping. To do this, it suffices to establish that for 
any S > we have 



lim lim sup A nM = 



where 



^ lY, (Uy^uus} - P (Yj > u n s\X)) 

.7=1 



ds 



> 5 



By Markov's inequality, conditional independence and Potter's bound [3, 
Theorem 1.5.6] , we have, for some e > 0, 



A n , M < C-j= 



ds<C 



nF{unj 



k 



s -l-a/2+e dg < CM ~ a /2+e _^ Q 



M 



as M — y 00, since = nF{u n ). This proves (58). To get a bound for (57), 
we use (61) which yields, for all t > 0, 



|T n (t)| < cv( Mn )(i + *) 
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Thus T n (£ n ) = P ( V *(u n )) and |f n (s+£„(l+s))| < Crf ( Ure )(l+ s )-^+ e (l + 
£ n )~ a , thus 

r n ( s + e„(i + s))i 

— ds = O p (?7 (tt n )) . 



o 



1 + 8 

We finally bound (56). 



o 1 + s i 6l 1 + w 

Since £ n = op(l), we can write 

p | fcl/2 ^ du>€ ^< m >1)+F Li* du > e 

fcl/2 roc E l/2rc-2/ \1 



ne Ji 1 + s 
Applying (44) and (70) yields 

n ^ E i V2[g " (s)] ds < Cpfr,*(u n ) [°° s -(^D/2+ e -i ds = 0j ,(a--i/2) . 
A 1 + s Jo 

Thus the first term in (56) is op(k~ 1 ^ 2 ), and so is the second term since 
k 1 / 2 n~ 1 S n converges uniformly to zero on compact sets. This concludes the 
proof of Corollary 2.7. 



4.6 Second order regular variation 

The main tool in the study of the tail of the product YZ is the following 
bound. For any e > 0, there exists a constant C such that, for all y > 0, 

P if5 1>X ? <C{lVy^). (59) 
This bound is trivial if y < 1 and follows from Potter's bounds if y > 1. 
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Proof of Lemma 2.1. By Breiman's Lemma, we know that for any sequence 
u n such that u n — > oo, 

lim G n (x, s) = lim n^)Zi > (l + s)u n ) ) = + g) _ a = ffa(x)r(a) 

n-j-oo n->oo F(Z > U n J 

(60) 

If E[cr Q+e (X)] < oo, then the bound (59) implies that the convergence (60) 
holds in L p (p) for any p such that pa < a + e, uniformly with respect to s, 
i.e. 

lim E[sup \G n (X, s) - a a (X)T(s)\ p ] = . 

□ 



Before proving Proposition 2.8, we need the following lemma which gives 
a non uniform rate of convergence. 

Lemma 4.1. If (4), (16) and (17) hold, if rf is regularly varying at infinity 
with index p, for some p < 0, then for any e > 0, there exists a constant C 
such that 



Vi > 1 , Vz > , 



F(Z > zt) 



[Z>t) 



< Cr]*(t)z- a+p (z Vz- l ) e . (61) 



Proof. Since 77* is decreasing, using the bound |e" — 1| < ue u+ with u + 
max(«, 0), we have, for all z > 0, 



> zt) 



P(Z > t) 





f 


= z~ a 


exp / 








fZVl 


< Cz~ 






JzAl 


< Cz~ 


a \og(z 



'' V{ts) 
s 

T]*(.St) 



ds - 1 
ds exp 



zvi 



Tj*{st) 



ds 



s 

2VI * 



7}*(st) 



ds 



< Cz- a (zAl) p - e/2 r]*(t) exp 



zAl S 
zVl 



T)*(st) 



' zAl S 

We now distinguish three cases. Recall that 77* is decreasing. 



ds . (62) 
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If z > 1, then z — >■ exp J* 2 s _1 r/*(s) ds is a slowly varying function 
by Karamata's representation Theorem, and is 0(z e / 2 ) for any e > 0. 
Plugging this bound into (62) yields (61). 



If z < 1 and tz > 1, then 
exp / as = exp 



^d S <exp r^ds 



for any e > by the same argument as above and this yields (61). 

If tz < 1, then t r < z~ r for any r > and t p ~ e = 0(i]*(t)) for any 
e > 0. Thus 



F(Z > zt) 
~^t) 



< 



1 



>t) 



-a < Ct a+e/2 + z ~a < Cz ~a-e/2 



< Cz- a+p -H p - e/2 < Cz- a+p - e 7]*(t) 



This concludes the proof of (61). 

The following bound is used in the proof of prove Theorem 2.6. 



□ 



Lemma 4.2. If (4), (16) and (17) hold, if rf is regularly varying at infinity 
with index p, for some p < 0, then there exists a constant C such that for all 
t > 1 and b > a > 0, 



<Z <bt) 



>t) 



< Cr]*(t)(a A iy a+p ~ e (b - a) . (63) 



Proof. The bound (63) follows from the following one and (59) applied to 
the function 77*. 



F(at <Z <bt) 



¥(Z > t) 



'a~ a - b- a ' 



< Cr]*((a A l)t)(a A l)""- 1 " 6 ^ - a) (64) 



Let I be the function slowly varying at infinity that appears in (4), defined 
on [0, 00) by £(t) = t a F(Z > t). Assumption (SO) implies that 



rt ds 
£(t) = £{1) exp / v {s) — 



(65) 
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where the function 77 is measurable and bounded. This implies that the 
function £ is the solution of the equation 



f\(s)£(s)- 



(66) 



Conversely, if £ satisfies (66) then (65) holds. We first prove the following 
useful bound. For any e > 0, there exists a constant C such that for any 
t > 1 and all a > 0, 



£(at) 



< Ca ±e 



(67) 



where we denote a ±e = max(a e ,a e ). Indeed, if at > 1, then, if being 
decreasing, we have 



£(at) 



< C exp 



aVl 



aAl 



rj*{ts) 



ds < C exp 



oV(l/a) * 



r]*(ts) 



ds < Ca 



±c 



since the latter function is slowly varying by Karamata's representation theo- 
rem. If at < 1, then £(at) < 1 and £~ l (t) = o(t e ) = o(a~ e ). This proves (67). 
Next, applying (66) and (67), for any e > and < a < b, we have 



£{bt) 



£(at) 



< Ca 



f b . , £(st) ds 



f , , £(st) ds 
A £{at) s 

< Crf{at) j s^" 1 ds <Crj*(at)a ±e -\b- a) 

J a 

Applying (67) and (68), we also obtain 

< Cr)*((aAl)t) a ±e . 



(68) 



£{at) _ 



For e > and < a < b, we have 

^/-i* -(<■-->-) = « 



(69) 



£(at) 
~W 

(a~ a - b- a ' 



g(ot) _ 



gag \ £{bt) 
£(t) \£(at) 
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which yields 



P(at < Z <bt) 



P(Z > t) 



< Cr]*{{a M)t)a a - l±t {b - a 



□ 



Proof of Proposition 2.8. Define the function a by o~(x) = cr(x) V 1. Apply- 
ing (61) with (1 + s)/a{x) instead of z and u n for t, we get 



\G n (x,s)-a a (x)T(s) 



F(a(x)Z > u n (l + s)) 



a Q (o;)T(s) 



< C V *(u n )a{x) a ^ +1)+e {l + s )~ a ^ +1)+e . (70) 



This implies, for all p such that E[a pa{p+l)+e (X)] < oo, that 



E 



svp\G n (X, s)-T(s)a a (X)\> 



S>1 



This proves (22) which in turn implies (21) since T n (s) = p^™^ K[G n (X, s)]. 

In order to prove that F Y E 2RV(-a,rj*), denote £(y) = y a F(Y > y). We 
will prove that there exists a measurable function fj such that (66) holds with 
£ and fj. Denote £ = cr(X). Applying (66) and using the independence of £ 
and Z, we have 



e(y)=E[Z a £(y/<T)]=e(l)E[t a ]+E 

= £(1)E[C] + E 
= E 



ds 

i s 



E 



f y ds 

E t Q f *(1) + f V(s)£(s)- 

f v ds f* ds 

e[^(i/oi+ / nev(s/ot(s/o}- = ^)+ / $00*00- 

Jl s Jl s 



r y ds 
C / v(s/0£(s/t)- 
Ji s . 

y ds 

ncv(s/o^/o]- 
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where we have defined 

E[£ a ri{s/Z)e{s/Z)] E[t" V {s/Z)e{s/Z)/£(s)] 



fj(s) 



E[Z»e(s/Q] E[^£(s/0/£(s)} 



The denominator of the last expression is bounded away from zero. Indeed, 
let e > be such that P(£ > e) > 0. Then 

neas/oms)] = T > ? > n< - t)nz > s/£) 



> s) ~ F(Z > s) 

Since Z has a regularly varying tail, it holds that inf s >oP(^ > s/e)/¥(Z > 
s) > 0. This proves our claim. Thus, applying (59) with the regularly varying 
function rj*, we get, for e > such that exp[^ a ~ p+t ] < oo, 

\fj(s)\<Cr,*(s)E[e{v%s/Oms)}{£(s/0/£(s)}}<^ . 

Thus i satisfies equation (66) with fj such that \r)\ < Crf, thus Y G 2RV(— a, t]*) 

□ 
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